Building and Evaluating an Annotated Corpus for Automated Recognition of Chat-Based Social Engineering Attacks

نویسندگان

چکیده

Chat-based Social Engineering (CSE) is widely recognized as a key factor to successful cyber-attacks, especially in small and medium-sized enterprise (SME) environments. Despite the interest preventing CSE attacks, few studies have considered specific features of language used by attackers. This work contributes area early-stage automated attack recognition proposing an approach for building annotating specific-purpose corpus presenting its application domain. The resulting then evaluated training bi-directional long short-term memory (bi-LSTM) neural network purpose named entity (NER). results this study emphasize importance adding plethora metadata dataset provide critical in-context produce that broadens our understanding tactics social engineers. outcomes can be applied dedicated cyber-defence mechanisms utilized protect SME employees using Electronic Medium Communication (EMC) software.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Building an annotated corpus for Amazighe

This paper gives an overview of the morpho-syntactic features of the Amazighe language and corpus encoding, afterwards we present our experience of constructing an annotated corpus with part-of-speech (POS) information. The annotated corpora consist of 20,667 Moroccan Amazighe tokens chosen from different materials; it is to our knowledge the first one dealing with Amazighe language. The experi...

متن کامل

Building an Annotated Corpus for Text Summarization and Question Answering

We describe ongoing work in semi-automatic annotating corpus, with the goal to answer “why” question in question answering system and give a construction of the coherent tree for text summarization. In this paper we present annotation schemas for identifying the discourse relations that hold between the parts of text as well as the particular textual of span that are related via the discourse r...

متن کامل

Building an Annotated Textual Inference Corpus for Motion and Space

This paper presents an approach for building a corpus for the domain of motion and spatial inference using a specific class of verbs. The approach creates a distribution of inference features that maximize the discriminatory power of a system trained on the corpus. The paper addresses the issue of using an existing textual inference system for generating the examples. This enables the corpus an...

متن کامل

MalToBI – Building an Annotated Corpus of Spoken Maltese

Research on the phonetics and phonology of Maltese, and in particular on different aspects of its prosody, is, thus far, rather limited. This is in part due to the lack of structured resources for use in research. One resource which, to date, has been unavailable, is a corpus of spoken Maltese. Such a corpus, could, amongst other things, be used as a ready resource for the analysis of various a...

متن کامل

Web-Based Sources for an Annotated Corpus Building and Composite Proper Name Identification

Nowadays, collections of texts with annotations on several levels are useful resources. Huge efforts are required to develop this resource for languages like Spanish. In this work, we present the initial step, lexical level annotation, for the compilation of an annotated Mexican corpus using Web-based sources. We also describe a method based on heterogeneous knowledge and simple Web-based sourc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2021

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app112210871